NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Open challenges and opportunities in federated foundation models towards biomedical healthcare

https://doi.org/10.1186/s13040-024-00414-9

Li, Xingyu; Peng, Lu; Wang, Yu-Ping; Zhang, Weihua (December 2025, BioData Mining)

This survey explores the transformative impact of foundation models (FMs) in artificial intelligence, focusing on their integration with federated learning (FL) in biomedical research. Foundation models such as ChatGPT, LLaMa, and CLIP, which are trained on vast datasets through methods including unsupervised pretraining, self-supervised learning, instructed fine-tuning, and reinforcement learning from human feedback, represent significant advancements in machine learning. These models, with their ability to generate coherent text and realistic images, are crucial for biomedical applications that require processing diverse data forms such as clinical reports, diagnostic images, and multimodal patient interactions. The incorporation of FL with these sophisticated models presents a promising strategy to harness their analytical power while safeguarding the privacy of sensitive medical data. This approach not only enhances the capabilities of FMs in medical diagnostics and personalized treatment but also addresses critical concerns about data privacy and security in healthcare. This survey reviews the current applications of FMs in federated settings, underscores the challenges, and identifies future research directions including scaling FMs, managing data diversity, and enhancing communication efficiency within FL frameworks. The objective is to encourage further research into the combined potential of FMs and FL, laying the groundwork for healthcare innovations.
more » « less
Full Text Available
SymBisect: Accurate Bisection for Fuzzer-Exposed Vulnerabilities

Zhang, Zheng; Hao, Yu; Chen, Weiteng; Zou, Xiaochen; Li, Xingyu; Li, Haonan; Zhai, Yizhuo; Qian, Zhiyun; Lau, Billy (August 2025, USENIX Security)

Full Text Available
AdaER: An adaptive experience replay approach for continual lifelong learning

https://doi.org/10.1016/j.neucom.2023.127204

Li, Xingyu; Tang, Bo; Li, Haifeng (March 2024, Neurocomputing)

Full Text Available
SymBisect: accurate bisection for fuzzer-exposed vulnerabilities

Zhang, Zheng; Hao, Yu; Chen, Weiteng; Zou, Xiaochen; Li, Xingyu; Li, Haonan; Zhai, Yizhuo; Qian, Zhiyun; Lau, Billy (August 2024, USENIX Security)

Full Text Available
FedLGA: Toward System-Heterogeneity of Federated Learning via Local Gradient Approximation

https://doi.org/10.1109/TCYB.2023.3247365

Li, Xingyu; Qu, Zhe; Tang, Bo; Lu, Zhuo (August 2023, IEEE Transactions on Cybernetics)

Full Text Available
Cross-View Geo-Localization via Learning Disentangled Geometric Layout Correspondence

https://doi.org/10.1609/aaai.v37i3.25457

Zhang, Xiaohan; Li, Xingyu; Sultani, Waqas; Zhou, Yi; Wshah, Safwan (June 2023, Proceedings of the AAAI Conference on Artificial Intelligence)

Cross-view geo-localization aims to estimate the location of a query ground image by matching it to a reference geo-tagged aerial images database. As an extremely challenging task, its difficulties root in the drastic view changes and different capturing time between two views. Despite these difficulties, recent works achieve outstanding progress on cross-view geo-localization benchmarks. However, existing methods still suffer from poor performance on the cross-area benchmarks, in which the training and testing data are captured from two different regions. We attribute this deficiency to the lack of ability to extract the spatial configuration of visual feature layouts and models' overfitting on low-level details from the training set. In this paper, we propose GeoDTR which explicitly disentangles geometric information from raw features and learns the spatial correlations among visual features from aerial and ground pairs with a novel geometric layout extractor module. This module generates a set of geometric layout descriptors, modulating the raw features and producing high-quality latent representations. In addition, we elaborate on two categories of data augmentations, (i) Layout simulation, which varies the spatial configuration while keeping the low-level details intact. (ii) Semantic augmentation, which alters the low-level details and encourages the model to capture spatial configurations. These augmentations help to improve the performance of the cross-view geo-localization models, especially on the cross-area benchmarks. Moreover, we propose a counterfactual-based learning process to benefit the geometric layout extractor in exploring spatial information. Extensive experiments show that GeoDTR not only achieves state-of-the-art results but also significantly boosts the performance on same-area and cross-area benchmarks. Our code can be found at https://gitlab.com/vail-uvm/geodtr.
more » « less
Full Text Available
On the Convergence of Multi-Server Federated Learning with Overlapping Area

https://doi.org/10.1109/TMC.2022.3200016

Qu, Zhe; Li, Xingyu; Xu, Jie; Tang, Bo; Lu, Zhuo; Liu, Yao (August 2022, IEEE Transactions on Mobile Computing)

Full Text Available
Generalized Federated Learning via Sharpness Aware Minimization

Qu, Zhe; Li, Xingyu; Duan, Rui; Liu, Yao; Tang, Bo; Lu, Zhuo. (January 2022, International Conference on Machine Learning)

Full Text Available
LoMar: A Local Defense Against Poisoning Attack on Federated Learning

https://doi.org/10.1109/TDSC.2021.3135422

Li, Xingyu; Qu, Zhe; Zhao, Shangqing; Tang, Bo; Lu, Zhuo; Liu, Yao (December 2021, IEEE Transactions on Dependable and Secure Computing)

Full Text Available
Learning with Instance-Dependent Label Noise: A Sample Sieve Approach

Cheng, Hao; Zhu, Zhaowei; Li, Xingyu; Gong, Yifei; Sun, Xing; Liu, Yang (May 2021, International Conference on Learning Representations)
null (Ed.)
Human-annotated labels are often prone to noise, and the presence of such noise will degrade the performance of the resulting deep neural network (DNN) models. Much of the literature (with several recent exceptions) of learning with noisy labels focuses on the case when the label noise is independent of features. Practically, annotations errors tend to be instance-dependent and often depend on the difficulty levels of recognizing a certain task. Applying existing results from instance-independent settings would require a significant amount of estimation of noise rates. Therefore, providing theoretically rigorous solutions for learning with instance-dependent label noise remains a challenge. In this paper, we propose CORES (COnfidence REgularized Sample Sieve), which progressively sieves out corrupted examples. The implementation of CORES does not require specifying noise rates and yet we are able to provide theoretical guarantees of CORES in filtering out the corrupted examples. This high-quality sample sieve allows us to treat clean examples and the corrupted ones separately in training a DNN solution, and such a separation is shown to be advantageous in the instance-dependent noise setting. We demonstrate the performance of CORES^2 on CIFAR10 and CIFAR100 datasets with synthetic instance-dependent label noise and Clothing1M with real-world human noise. As of independent interests, our sample sieve provides a generic machinery for anatomizing noisy datasets and provides a flexible interface for various robust training techniques to further improve the performance. Code is available at https://github.com/UCSC-REAL/cores.
more » « less
Full Text Available

« Prev Next »

Search for: All records